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RATE-DISTORTION VIDEO DATA PARTITIONING USING CONVEX HULL SEARCH 



5 The present invention relates generally to scalable video coding systems and more 

particularly to rate-distortion optimized data partitioning (RDDP) of discrete cosine transform 
(DCT) coefficients for video transmission. 

Video is a sequence of pictures. Each picture is formed by an array of pixels. The 
size of uncompressed video is huge and therefore video compression is often used to reduce 

10 the size and improve the data transmission rate. Various video coding methods (e.g., MPEG 
1, MPEG 2, and MPEG 4) have been established to provide an international standard for the 
coded representation of moving pictures and associated audio on digital storage media. 

Such video coding methods format and compress the raw video data for reduced rate 
transmission. For example, the format of the MPEG 2 standard consists of 4 layers: Group of 

IS Pictures, Pictures, Slice, Macroblock. A video sequence begins with a sequence header that 
includes one or more groups of pictures (GOP), and ends with an end-of-sequence code. The 
Group of Pictures (GOP) includes a header and a series of one of more pictures intended to 
allow random access into the video sequence. The MPEG 2 standard defines three types of 
pictures: Intra Pictures (I-Pictures) Predicted Pictures (P-Pictures); and Bidirectional Pictures 

20 (B-Pictures) which are combined to form a group of pictures. 

The pictures are the primary coding unit of a video sequence. A picture consists of 
three rectangular matrices representing luminance (Y) and two chrominance (Cb and Cr) 
values. The Y matrix has an even number of rows and columns. The Cb and Cr matrices are 
one-half the size of the Y matrix in each direction (horizontal and vertical). The slices are 

25 one or more "contiguous" macroblocks. The order of the macroblocks within a slice is from 
left-to-right and top-to-bottom. 

The macroblocks are the basic coding unit in the MPEG algorithm. The macroblock 
is a 16x16 pixel segment in a frame. Since each chrominance component has one-half the 
vertical and horizontal resolution of the luminance component, a macroblock consists of four 

30 Y, one Cr, and one Cb block. The block is the smallest coding unit in the MPEG algorithm. 
It consists of 8x8 pixels and can be one of three types: luminance (Y), red chrominance (Cr), 
or blue chrominance (Cb). The block is the basic unit in intra frame coding. 
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The MPEG transform coding algorithm includes the following coding steps: Discrete 
cosine transform (DCT), Quantization and Run-length encoding. 

An important technique in video coding is scalability. In this regard, a scalable video 
codec is defined as a codec that is capable of producing a bitstream that can be divided into 
5 embedded subsets. These subsets can be independently decoded to provide video sequences 
of increasing quality. Thus, a single compression operation can produce bitstreams with 
different rates and reconstructed quality. A small subset of the original bitstream can be 
initially transmitted to provide a base layer quality with extra layers subsequently transmitted 
as enhancement layers. Scalability is siq)ported by most of the video compression standards 

10 such as MPEG-2, MPEG-4 and H.263. 

An important application of scalability is in error resilient video transmission. 
Scalability can be used to apply stronger error protection to the base layer than to the 
enhancement layer(s) (i.e., unequal error protection). Thus, the base layer will be successfully 
decoded with high probability even during adverse transmission channel conditions. 

15 Data Partitioning (DP) is used in connection with the encoder to facilitate scalability. 

As such, a merging technique is used in connection with the decoder to merge the data to 
form the correct video images. 

With respect to data partitioning, for example in MPEG 2, the slice layer indicates the 
maximum number of block transform coefBcients contained in the particular bitstream 

20 (known as the priority break point). Data partitioning is a frequency domain method that 

breaks the block of 64 quantized transform coefficients into two bitstreams. The first, higher 
priority bitstream (e.g., base layer) contains the more critical lower frequency coefficients 
and side information (such as DC values, motion vectors). The second, lower priority 
bitstream (e.g., enhancement layers) carries higher frequency AC data. 

25 One technique for implementing data partitioning outside an encoder entails providing 

at the transmitter, a demultiplexer which receives from the variable length decoder (VLD) the 
nimiber of bits used for each variable length code and separates the bitstream based on the 
priority break point (PBP) value. Note that the PBP's can be changed at each slice based on 
the rate partitioning logic used. In conventional data-partitioning (DP) video coders (e.g., 

30 MPEG), a single layer bit stream is partitioned into two or more bit streams in the DCT 

domain. During transmission, one or more bit streams are sent to achieve bit rate scalability. 
Unequal error protection can be applied to base layer and enhancement layer data to improve 



wo 2005/029868 



PCT/IB2004/051811 



PCT/IB2004/051811 



PHUS030348WO 



the resistance to channel degradation. 

As to merging of the partitioned data outside the decoder, two VLD's may be used to 
process the base layer and enhancement layer streams and then output a nonlayered bitstream. 
The PBP value defines how an encoded bitstream is partitioned. Before decoding, depending 
5 on resource allocation and/or receiver capacity, the received bit-streams or a subset thereof 
are merged into one single bit-stream and decoded. 

The conventional DP structure has many advantages in the home network 
environment. More specifically, at its full quality, the rate-distortion performance of the DP 
is as good as its single layer counterpart while rate scalability is also allowed. The rate- 
10 distortion (R-D) performance is concerned with finding an optimal combination of rate and 
distortion. This optimal combination, which could also be seen as the optimal combination of 
cost and quality, is not unique. R-D schemes attempt to represent a piece of information with 
the fewest bits possible and at the same time in a way that will lead to the best reproduction 
quality. 

IS It is also noted that in the conventional DP structure, the additional decoding 

complexity overhead is very minimal at its full quality while the DP provides wider range of 
decoder complexity scalability. This is because variable length decoding (VLD) of DCT run- 
lengtti pairs, which is the most computational extensive part, now becomes scalable. 

In the conventional DP structure, the DCT priority break point (PBP) value needs to 
20 be transmitted explicitly as side information. To minimize overhead, the PBP value is usually 
fixed for all the DCT blocks within each slice or video packet. While the conventional DP is 
simple and has many advantages, there is little room for base layer optimization because only 
one PBP value is used for all blocks within each slice or video packet. 

While the conventional DP method is simple and has some advantages, it is not 
25 capable of adapting base layer optimization because only one PBP value is used for all blocks 
within each slice or video packet. 

Accordingly, there exists a need for video coding techniques that overcome the 
limitations of the conventional data partitioning scheme and provide improved base layer 
optimization. 

30 In tiie inventor's related disclosure entitled System and Method of Rate-Distortion 

Optimized Data Partition for Video Coding Using a Parametric Rate-Distortion Model 
assigned USSN 60/463,747 filed April 18. 2003; refiled July 29, 2003 and assigned USSN 



wo 2005/029868 PCT/IB2004/051811 

PCT/IB2004/051811 

4 

PHUS030348WO 



60/490,835 (corresponding to Applicant's Reference No. 703553), incorporated by reference 
herein in its entirety, a rate-distortion optimized data partitioning (RDDP) is described which 
provides a breakthrough for data partitioning by allowing the PBP value to adapt each at 
DCT block level with minimal overhead (-20 bits for each slice or video packet) by 
5 employing context-based backward adaptation. Such a block-by-block adaptation is always 
performed in a rate-distortion optimization scheme which guarantees that the RDDP achieves 
a nearly optimal video quality under certain convexity conditions on the rate-distortion (RD) 
planes. 

The RDDP is based on a Lagrangian optimization algorithm. A primary advantage of 
10 the Lagrangian approach for rate-distortion optimization is its independent property for each 
signal element. More specifically, the theoretical performance limit of the data partitioning 
can be achieved by minimizing the following cost function: 

min{D/*^+A/2,^*>} , / = !,... ,2 (1) 

h 

15 

where Z>/*^and R}^^ denote distortion and rate for the base layer of the i-th DCT block when 

the break point is h and Q denotes the number of total DCT blocks in each frame. The 
solution of the Lagrangian optimization problem (1) lies in the convex hull of the R-D points. 
Considering a typical convex R-D curve as shown in FIG. 1, the minimum 

20 Lagrangian function is achieved for that point which is "'hit" first by the plane wave of 
absolute slope A (S = - A) impinging on the rate-distortion curve. If every admissible 
operating point lies on the convex hull, then the absolute slope before the optimal operating 
point is greater than A , while the absolute slope after the optimal point is less than or equal to 
A. This implies that DCT run-level pairs for the convex R-D curve should satisfy the 

25 condition of: 



r >A , k^h, 

\ <A , k>h, ^ ^ 
where A is the Lagrangian multiplier or quality factor, iV* and C* denote the k-th DCT 
code length and level for the /-th DCT blocks, respectively, and hj denotes the optimal 
30 breakpoint value for the i-th DCT blocks. Since the values of C/ and are known for 
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both the encoder and decoder, a basic idea of RDDP is that instead of encoding and 
transmitting the optimal breakpoint value hf, only the quality factor X is encoded and 

transmitted to the decoder and then the decoder deduces the breakpoint hf from and Nf • 

It has been found that the RDDP algorithm using Eq. (2) is near optimal in the sense 
5 that only one more run, level pair is included into tiie base layer compared to the optimal one. 
This run, level pair is the point on the rate-distortion curve at which the slope turns from 
being greater than X to being lower than or equal to L 

In practice, R-D curves for the DCT blocks are often non-convex. In fliis case, the 
partitioning rule given by Eq. (2) is not necessarily valid and the optimality of RDDP is no 
10 longer assured. For example, for the non-convex R-D curve shown in FIG. 2, the optimal or 
priority break point (PBP) value should be while the RDDP algorithm provides a break 
point value of , which makes the base layer under-partitioned. 

Since the priority break point (PBP) value defines how an encoded bitstream is 
partitioned, Le., for decoding purposes, the received bitstreams are decoded based on the 
15 priority break pomt value, it is important to be able to have or determine the same priority 
break point (PBP) value for both encoding and decoding purposes. 

It is an object of the present invention to provide an improved rate-distortion 
optimized data partitioning technique and algorithm. It is another object of the present 
invention to provide a rate-distortion optimized data partitioning teclmique for video using 
20 backward adaptation. It is a further object of the present invention to provide a new rate- 
distortion optimized data partitioning (RDDP) technique which employs an incremental 
computation algorithm of convex hull and slopes which overcomes drawbacks of other 
RDDP algorithms. 

It is still another object of the present invention to provide a video coding technique 
25 which overcomes the limitations of the conventional data partitioning techniques and 
provides improved base layer optimization. 

In order to achieve these objects and others, in accordance with one form of the 
present invention, a method for partitioning video data into a base layer and at least one 
enhancement layer includes the steps of receiving video data and separating it into a plurality 
30 of fimnes which are further separated into a plurality of blocks, determining DCT coefficients 
for the blocks and for each block, quantizing the DCT coefficients, converting the quantized 
DCT coefficients of the base layer into a set of (run, length) pairs, and determining a 
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partitioning point by analyzing the slope of lines only between adjacent pairs of (run, length) 
pairs which lie on the convex hull. Once the partitioning point is determined, only those (run, 
length) pairs before and inclusive of the partitioning point are encoded for transmission in the 
base layer and those (run, length) pairs after the partitioning point are encoded for 
5 transmission in the enhancement layer(s). 

In one embodiment, the partitioning point is determined by analyzing the slope of 
lines only between adjacent pairs of (run, length) pairs which lie on a causally optimal 
convex hull such that the causally optimal convex hull can be determined synchronously 
upon encoding the (run, length) pairs and decoding the (nm, length) pairs. 

10 More specifically, in one exemplifying method for determining the partitioning point, 

the slope of lines between all adjacent pair of the (run, length) pairs are determined and a 
determination is made as to which of the (run, length) pairs lie on the causal convex hull 
based on the slope of the lines between the adjacent pairs of (run, length) pairs. The 
partitioning point is then determined based on the slope of the lines between the adjacent 

15 pairs of (run, length) pairs which lie on the causal convex hull. For example, the slopes of the 
lines between the (run, length) pairs which lie on the causal convex hull are compared 
relative to a quality factor common to all of the blocks in each frame. The quality factor may 
be placed in a header of the frame. In this maimer, the partitioning point for each block, 
which may vaxy for each block, is determined based on the slope of the lines between the 

20 aidjacent pairs of (run, length) pairs which lie on the causal convex hull and on a quality 
factor common for all blocks in a frame. 

Determining which pairs lie on the causal convex hull may entail determining a 
distortion-length slope between each pair in the set (except for the first and last) and a 
preceding pair and between that pair and a following pair and determining whether the 

25 distortion-length slope between that pair and the following pair is less than the distortion- 
length slope between that pair and the preceding pair, and if so, considering that pair to lie on 
the causal convex hull. A causal convex hull set is thus formed from the pairs determined to 
lie on the causal convex hull and the first pair in the (run, length) set. 

In accordance with another form of the present invention, a scalable video system 

30 includes a source encoder for encoding video data and outputting encoded data having a base 
layer and at least one enhancement layer. The encoder determines DCT coefficients for a 
plurality of blocks of a video frame to form a base layer and at least one enhancement layer. 
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and for each block, quantizes the DCT coefficients, converts the quantized DCT coefficients 
of the base layer into a set of (run, length) pairs, and determines a partitioning point by 
analyzing the slope of lines only between adjacent pairs of (run, length) pairs which lie on the 
convex hull. The encoder then encodes only those (run, length) pairs before and inclusive of 
5 the partitioning point into a transmission of the base layer and encodes those (run, length) 
pairs after the partitioning point into a transmission of the enhancement layer(s). More 
specifically, the encoder can be designed to determine the partitioning point by determining 
the slope of lines between all adjacent pairs of the (run, length) pairs, determining which of 
the (run, length) pairs lie on a causal convex hull based on the slope of the lines between the 
10 adjacent pairs of (run, length) pairs, and then determining the partitioning point based on the 
slope of the lines between the adjacent pairs of (run, length) pairs which lie on the causal 
convex hull. 

The video system can also include a source decoder for decoding video data having 
the base layer and at least one enhancement layer and outputting decoded data. The decoder 
1 5 decodes the video data based on a partitioning point determined fit)m the causal (run, length) 

pairs in Hhe base layer and the enhancement layer. > . 

The invention, together with further objects and advantages thereof, may best be 
understood by reference to the following description taken in conjunction with the 
accompanying drawings, wherein like reference numerals identify like elements and wherein: . ' . 
20 FIG. 1 is an example of a convex rate-distortion (R-D) curve; 

FIG. 2 shows a non-convex R-D curve for which the application of another RDDP 
technique would not provide an optimal breakpoint value but for which the embodiment of 
the present invention can be applied; 

FIG. 3 is a flow chart showing the steps in a method for processing video data in 
25 accordance with the invention; 

FIG. 4 shows a convex hull formed by truncation points for a DCT block in which the 
algorithm in accordance with the invention is applied; and 

FIG. 5 is a schematic of a video system capable of applying the techniques in 
accordance with the invention. 
30 The invention is applicable in a scalable video system with layered coding and 

transport prioritization in which a layered source encoder encodes input video data and a 
layered source decoder decodes the encoded data. The output of the source encoder includes 
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a base layer and one or more enhancement layers. A plurality of channels cany the output 
encoded data. 

There are different ways of implementing layered coding. For example, in temporal 
domain layered coding, the base layer contains a bit stream with a lower frame rate and the 
5 enhancement layers contain incremental information to obtain an output with higher frame 
rates. In spatial domain layered coding, the base layer codes the sub-sampled version of the 
original video sequence and the enhancement layers contain additional information for 
obtaining higher spatial resolution at the decoder. Generally, a different layer uses a different 
data stream and has distinctly different tolerances to channel errors. To combat channel 

10 errors, layered coding is usually combined with transport prioritization so that the base layer 
is delivered with a higher degree of error protection. If the base layer is lost, the data 
contained in the enhancement layers may be useless. 

The video quality of the base layer may be flexibly controlled at the DCT block level. 
The desired base layer can be controlled by adapting the PBP value at the DCT block level by 

15 employing parametric RD model to approximate the convex hull of the RD planes for each 
DCT blocks, thereby finding the optimal partitioning points synchronously at the encoder and 
decoder. 

DCT is used to reduce the spatial correlation between adjacent error pixels, and to 
compact the energy of the error pixels into a few coefficients. Since many high frequency 

20 coefficients are zero after quantization, variable length coding (VLC) is accomplished by a 
run-length coding method, which orders the coefficients into a one-dimensional array using a 
so-called zig-zag scan so that the low-frequency coefficients are put in front of the high- 
frequency coefficients. This way, the quantized coefficients are specified in terms of the non- 
zero values and the number of the preceding zeros. Different symbols, each corresponding to 

25 a pair of zero run-length, and non-zero value, are coded using variable length codewords. 

The scalable video system may use entropy coding in which quantized DCT 
coefficients are rearranged into a one-dimensional array by scanning them in a zig-zag order. 
This rearrangement puts the DC coefficient at the first location of the array and the remaining 
AC coefficients are arranged from the low to high frequency, in both the horizontal and 

30 vertical directions. The assumption is that the quantized DCT coefficients at higher 
frequencies would likely be zero, thereby separating the non-zero and zero parts. The 
rearranged array is coded into a sequence of die run-level pair. The run is defined as the 
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distance between two non-zero coefficients in the array. The level is the non-zero value 
immediately following a sequence of zeros. This coding method produces a compact 
representation of the 8x8 DCT coefficients, since a large number of the coefficients have 
been already quantized to zero value. 
5 The run-level pairs and the information about the macroblock, such as the motion 

vectors, and prediction types, are further compressed using entropy coding. Both variable- 
length and fixed-length codes are used for this purpose. 

The design of the video system is motivated by the operational rate-distortion (RD) 
theory. RD theory is useful in coding and compression scenarios, where the available 

10 bandwidth is known a priori and where the purpose is to achieve the best reproduction quality 
that can be achieved within this bandwidth (i.e,, adaptive algorithms). 

Referring now to FIG. 3, in accordance with the present invention, an incremental 
computation algorithm is employed for convex hull and slope R-D curves such as shown in 
FIG. 2. The incremental algorithm computes the convex hull and R-D slope for each DCT 

15 block of each video frame using preceding run-length variable length coder (VLC) pairs in a 
computationally efficient manner. The computation of the convex hull is causal-optimal in 
tiie sense that the computed convex hull is the true convex hull for the given causal pairs of 
(run, length) pairs. Therefore, the same convex hull and R-D slope can be computed 
synchronously at the encoder and decoder. 

20 Generally, for each DCT block of a video frame, the DCT coefficients are quantized 

and converted into a set of (run, length) pairs (step 10). Each (run, length) pair is represented 
by (L{*\Z>/*0 as shown in FIG. 4. The slope of the lines between each adjacent pair of (run, 
length) pairs is then determined (step 12). For example, the slope between the initial (run, 
length) pair (designated 0) and the second (run, length) pair (designated 1), the slope between 

25 the initial (run, length) pair (designated 0) and the second (run, length) pair (designated 1), 
etc., are determined. 

Once the slope between each adjacent pair of (run, length) pairs is determined, a 
determination is made as to which (run, length) pairs lie on the convex hull (step 14). 
Encoding and decoding of the block of the video frame is based on the determined slopes of 
30 the line. 

This technique will be illustrated with reference to FIG. 4 wherein the R-D pairs of 
the-(run, length) of the i-th DCT block are shown and (X5*\D/*0 denotes the rate-distortion 



wo 2005/029868 



PCT/IB2004/051811 



PCT/IB2004/051811 

10 



PHUS030348WO 



pairs of the base layer including up to k (run, length) pairs, anid hf denotes the p-th rate- 
distortion pairs on the convex hull. The convex hull slope (designated S) which equals 
- ^lihf) denotes the "distortion-length" slope at hf . 

As shown in FIG. 4, some of the rate-distortion pairs do not lie on the convex hull. 
5 Namely, only 5 (run, length) pairs, (4*\Z)/*^) for k = 0, 2, 4, 7 and 9, lie on the convex hull. 

The solution for the optimization problem, the minimization of the cost function, Eq. (1), will 
be among those five rate-distortion pairs, i.e., h € {0,2,4,7,9} . Thus, if we have all the access 
of the rate-distortion pairs, only these rate-distortion pairs will be used to determine the 
partitioning slope, between the base layer and the enhancement layer. In order to find the 
10 feasible points, tiie convex hull and the resultant distortion-length slopes are computed. An 
exemplifying fast incremental computation algorithm of the convex hull and distortion-length 
slope is given as follows: 

Set A,(0) <- oo,i/, <- {0} and A*^ <- 0, 

15 Forz=l,2,...,Zi 

{ // for each rate-distortion pair 

Set AZ> Z>/*^> -i>/^> and AI <- L^p -^if^^ ; 
If AD>0 

{While AD>X,ih'^)AL 

20 {Set Hf ir- Hf\{h^} //exclude last elements of current 

convex hull set 
Set A'"' «— max //g&t last element in new convex hull 
set 

Set AD ^ -D/-'> and AL Zj'> -Z{**"> } 
25 Set/i*^<-z 

Set Hi<r'H,u{h''^} 
Set A(A'^)^AD/AZ:}} 
In the above algorithm, denotes the convex hull set, which is continuously being 
updated as more rate-distortion pairs are processed. In the data-partitioning problem, AD and 
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AL can be easily computed as follows: 



5 where Cf^Nf denotes the de-quantized DCT coefficient and code length of the k-th DCT 
(run, length) pairs. 

Once the (run, length) pairs on the convex hull are determined, the partitioning point 
for each block is determined based on the quality factor 8 (which is the same for all blocks in 
the same frame) and the slope of the lines between the adjacent pairs of (run, length) pairs on 

10 the convex hull (step 16). 

The algorithm is not causal in the sense that all the rate-distortion pairs should be 
processed to construct the **true" convex hull and the distortion-length slope. Without side 
information, the decoder can only decide the partitioning points based on the causal rate- 
distortion pairs. Therefore, in a preferred embodiment, the above convex hull search 

15 algorithm is modified to use only causal rate-distortion or (run, length) pairs. By applying the 
algorithm described above and Eq. (1), the partitioning point can be obtained from the causal 
(run, length) pairs and those (run, length) pairs before the partitioning point are encoded into 
the base layer (regardless of whether they lie on the convex hull or not) while the (run, 
length) pairs after the partitioning point are encoded into the enhancement layer(s) (step 18) 

20 In this manner, the invention provides a new partitioning rule without requiring the 
transmission of side information based on causally optimal convex hull computation. 

At the decoder side, the decoder receives the transmitted base layer and enhancement 
layer(s) and based on the (run, length) pairs included in the base layer and enhancement 
layer, it calculates the slope of the lines between each adjacent pair of (run, length) pairs, 

25 determines which lie on the causal convex hull and then based on the quality factor 8, 
determines the partitioning point (step 20). Since the same algorithm to determine the 
partitioning point is used in both the encoder and decoder, the same partitioning point will be 
obtained. Although the calculation of the slope between the lines is required at both the 
encoder and decoder side, the advantage of avoiding the transmission of side information is 

30 maintained. 

With respect to the partitioning between base and enhancement layer, the proposed 
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algorithm is given in the following manner: 
ALGORITHM: ENCODER 
Encode quality factor quality factor X into base layer. 
Set A,(0) oo,if, <- {0} and A'^ ^ 0. 

5 Forzr=l,2,...,Zi 

{ // for each run-length pairs 
Encode the z-th (runjength) pairs into base layer. 
Compute Cf' and N' . 

Set AD ir- XtC/]^ andALir- 
***** 

10 If AD > 0 

{While AZ>>A,(/i*^)AL 

{Set H,<-Hf\ {h'"* } //exclude last elements of current 

convex hull set 
Set h'^ <— max //get last element in new convex hull 
15 set 

Set AD<-AD + [C/''"f and AL AL + i\r/" } 
Set A^' <- z 
Set H, <r'HfKj{h'^} 

Set Z(h^)<r-AD/AL 
20 If A(A*'^)<A break.}} 

End 

Put the remaining (run, length) pairs into the enhancement layer. 

At the decoder side, the mei:ging algorithm is given as follows: 
25 ALGORITHM: DECODER 

Decode quality factor quality factor X fix)m base layer. 
Set A^O) <- oo^Hf <- {0} and /i*^ <- 0. 
Forz=l,2,..,,Zi 
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{ // for each run-length pairs 

Decode the z-th (run,length) pairs from the base layer. 
Compute C/ and Nf . 

SetAD<- Xl^*]' andAi<- ^4^1 

If AD>0 

{While AD>A,(/i*^)AL 

{Set Hi <- \ {A** } //exclude last elements of current 

convex hull set 
Set A'*** <— max Hi //get last element in new convex hull 
set 

Set AD^AD + EC*'"]' and AL ^ AL + T/f'"' } 

Set h'""' <r- z 
Set if, <-ir,u{/i^'} . 
Set A(A*"0<-AD/AL 
If X(h^)<X break.}} 

End 

Decode remaining (run, length) pairs from the enhancement layer. 

Note that the proposed algorithm is causally optimal in the sense that the resultant 
20 convex hull is the optimal convex hull given the causal (run, length) pairs. Hence, the 
decoder can also reconstruct the identical convex hull and furthermore, the identical 
partitioning points by comparing the quality factor A . 

FIG. 5 shows a scalable video system 22 capable of applying the algorithms described 
above. The scalable video system includes a scalable source encoder 24 capable of 
25 partitioning data into a base layer and at least one enhancement layer having data 
representing (run, length) pairs for a plurality of macroblocks in a video frame. The encoder 
24 includes a memory 26 which stores computer-executable process steps and a processor 28 
which executes the process steps stored in the memory 26 so as to determine a partitioning 
point This may be accomplished in the manner described above, for example, by analyzing 



10 
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the slope of lines only between adjacent pairs of (run, length) pairs which lie on a causal 
convex hull and include in the base layer only the (run, length) pairs before and inclusive of 
the partitioning point and include in the enhancement layer(s), the (run, length) pairs after the 
partitioning point The processor 28 can thus determine the partitioning point by determining 
5 the slope of lines between all adjacent pairs of the (run, length) pairs and determining which 
of the (run, length) pairs lie on the causal convex hull based on the slope of the lines between 
the adjacent pairs of (run, length) pairs. The partitioning point is then determined based on 
the slope of the lines between the adjacent pairs of (run, length) pairs which lie on the causal 
convex hull. 

10 The system 22 also includes a scalable decoder 30 capable of merging data from the 

base layer and the enhancement layer(s). The decoder 30 includes a memory 32 which stores 
computer-executable process steps and a processor 34 which executes the process steps 
stored in the memory 32 so as to receive the base layer and the enhancement layer(s) and 
determine a partitioning point based on the (run, length) pairs included in the base layer and 

15 in the enhancement layer(s) by analyzing only causal (run, length) pairs. 

Although illustrative embodiments of the present invention have been described 
herein with reference to the accompanying drawings, it is to be understood tiiat the invention 
is not limited to these precise embodiments, and that various other changes and modifications - 
may be effected therein by one of ordinary skill in the art without departing fix>m the scope or 

20 spirit of the invention* 



